Emergence of a mutual-growth mechanism in networks evolved by social preference based on indirect utility

Preferential attachment is an important mechanism in the structural evolution of complex networks. However, though resources on a network propagate and have an effect beyond a direct relationship, growth by preferential attachment based on indirectly propagated resources has not been systematically investigated. Here, we propose a mathematical model of an evolving network in which preference is proportional to a utility function reflecting direct utility from directly connected nodes and indirect utility from indirectly connected nodes beyond the directly connected nodes. Our analysis showed that preferential attachment involving indirect utility forms a converged and hierarchical structure, thereby significantly increasing the indirect utility across the entire network. Further, we found that the structures are formed by mutual growth between adjacent nodes, which promotes a scaling exponent of 1.5 between the number of indirect and direct links. Lastly, by examining several real networks, we found evidence of mutual growth, especially in social networks. Our findings demonstrate a growth mechanism emerging in evolving networks with preference for indirect utility, and provide a foundation for systematically investigating the role of preference for indirect utility in the structural and functional evolution of large-scale social networks.

theoretically the structural characteristics of networks that social preferences based on indirect links form, and little is known about the social effects of such network structures.
In this study, we investigate the mechanism of structural formation of an evolving network that includes indirect resources as social preferences and its expected social effects.For this purpose, we propose a new mathematical model of evolving networks with a preferential attachment based on utility.Utility is the physical, economic, and psychological effect obtained from various resources, and is usually described as a function consisting of terms of costs and benefits 30,31 .And, from the perspective of network, the utility can be obtained from nodes reachable on the network.We introduce a utility function that considers utility not only from directly connected nodes but also from indirectly connected nodes beyond the directly connected nodes [32][33][34][35] .In this utility function, the benefit term of a node that can be obtained from other nodes is designed according to the degree of separation from that node, and, in the case of direct connections, the cost term is designed additionally for the establishment and maintenance of the relationship (see Method for details ).
In what follows, we first present our analysis of the temporal networks that a utility-based preferential attachment model evolves under various conditions by adjusting the cost and benefit terms of a utility function.From a macroscopic perspective, our analysis takes into account the structural characteristics of a network and the growth of utility in the entire network; microscopically, we take into account the growth patterns of the direct and indirect links around an individual node through mathematical analysis and numerical simulations.Furthermore, our investigation of growth patterns of direct and indirect links around an individual node in real networks, especially social networks, confirms that the growth approximates the estimation of the preferential attachment model based on indirect utility.
Our results suggest that the utility-based preferential attachment model can provide a foundation for understanding the structural and functional evolution of large-scale social networks.In particular, we show that preferential attachment based on indirect utility stably forms a converged and hierarchical structure.This stability provides robust evidence of emerging mutual growth among adjacent nodes in social networks.

Illustrative explanation of the utility-based preferential attachment model
Our analysis of the utility-based preferential attachment model takes into consideration the utility within the 2nd degree of separation.The illustration in Fig. 1 serves to explain the preferential attachment of this model.A new node j connects to an existing node i with the preferential attachment probability [2] i t j at time t j (> t i ) .� [2] i (t) is proportional to the utility u [2] i (t) = k [1] i (t)(b 1 − c) + k [2] i (t)b 2 , where k [l] i (t)(l = 1, 2) is the number of links at the l th degree of separation from node i , b l (l = 1, 2) is a nonnegative-valued function representing the benefit of node i obtained from a node within the l th degree of separation, c is a positive-valued parameter representing the cost of node i to connect with a node within the 1 st degree of separation.Notably, node i incurs no cost for establishing and maintaining indirect links.
Figure 1.Preferential attachment of the utility-based preferential attachment model.A new node j connects to an existing node i with the preferential attachment probability [2] i t j at time t j (> t i ) .� [2] i (t) is proportional to the utility u [2] i (t is the number of direct links with the l th degree of separation from a node i , b l (l = 1, 2) and c are parameters which associate the weight of benefits and costs, respectively (see the "Methods" section).In this example, node i is directly connected to nodes i 1 and i 2 , i.e., k [1] i t j = 2 .Therefore, node i must pay 2c as the cost for the direct links to obtain 2b 1 from nodes i 1 and i 2 as the direct benefit.Thus, the direct utility is described as 2(b 1 − c) .Also, node i obtains 4b 2 as the indirect benefit because it connects indirectly to four nodes, i 1 1 , i 1 2 , i 2 1 , and i 2 2 , i.e., k [2] i t j = 4 .Thus, the indirect utility becomes 4b 2 .
www.nature.com/scientificreports/To investigate the effect of indirect utility on the structure and growth of evolving networks, we compared four special cases of the utility-based preferential attachment model, models U D , U I , U M+ , U M-with a randomattachment model, U R .In every model, a conventional growth rule is used such that a new node joins at each step and makes connections with an existing node with a specific preferential attachment probability.The U D model takes into consideration only the direct utility, i.e., � [2] i (t) ∝ k [1] i (t) .The U I model takes into consid- eration only the indirect utility, i.e., � [2] i (t) ∝ k [2] i (t) .The U M+ and U M-models take into consideration both the direct and indirect utilities.The U M+ model represents a case in which the direct benefit exceeds the cost ( b 1 = c + 1, b 2 = 1 ), i.e., � [2] i (t) ∝ k [1] i (t) + k [2] i (t) , whereas the U M-model represents a case in which the cost exceeds the direct benefit ( b 1 = c − 1, b 2 = 1 ), i.e., � [2] i (t) ∝ −k [1] i (t) + k [2] i (t) .The attachment probability of the U R model is the same for all of the existing nodes in the network.

The effect of indirect utility on the overall structure and growth of evolving networks
Figure 2 shows the effect of indirect utility on the structure of networks formed by the U R , U D , U I , U M+ , and U M- models.Figure 2a illustrates the networks formed by the U R , U D , and U I models at around t = 5000 .A node with more direct links than the surrounding nodes, hereafter referred to as a "local hub", is marked in red, and other nodes are marked in blue as the distance from a local hub increases.The distance is defined as the number of The red colour represents the local hub, which is a node with k [1] i greater than the surrounding nodes, and it is displayed in blue gradually according to the distance from the nearest local hub.In U I model, a converged and hierarchical structure forms in which many nodes grow sequentially around a very large local hub.(b,c) The distribution P(k [1] ) and the growth of the ratio k [2] (t)(≡ t j=1 k [2] j (t)/ t j=1 k [1] j (t)) in the utility-based preferential attachment models.The solid line represents the average of 20 numerical simulations ( t = 100, 000 ), and the error bar denotes the standard deviation.
steps taken to find a local hub by recursively searching the neighbour with the most direct links.In the network formed by the U R model, all of the nodes grow evenly, while, in the networks formed by each of the U D and U I models, the growth of a hub with a large number of direct links is conspicuous, resulting in a heavy-tailed degree distribution.In particular, the network formed by the U I model has a converged and hierarchical structure in which many nodes grow sequentially around a very large local hub.
The degree distribution, P(k [1] ) of U R (blue line in Fig. 2b) and U D (orange line in Fig. 2b) shows the expo- nential and the scale-free property, respectively, which is well known as characteristics of the random attachment model and the degree-based preferential attachment model in evolving networks 3 .Meanwhile, the U I , U M+ , and U M-models have a bump property in the degree distribution (green, red, and purple plots).These models commonly have an indirect utility in preference, and it is known that including 2 nd degree of separation in preference result in time dependent degree distributions, and gradual collapse of its scaling 24 .As shown in the U I model of Fig. 2a, the bump property of the degree distribution is associated with the converged and hierarchical structure, and it can be seen that it is a common structure formed by indirect utility.
The growth of the indirect links as shown in Fig. 2c also confirms the difference between the models.Here, k [2] (t) is defined as k [2] (t) ≡ t j=1 k [2] j (t)/ t j=1 k [1] j (t) .Because t j=1 k [1] j (t) can be calculated as t j=1 k [1] j (t) ≈ 2t in each model, the difference among these models results from t j=1 k [2] j (t) , which represents indirect benefits among overall network.As Fig. 2c shows, the growth pattern of k [2] (t) for each model is distinct.The U R model shows that the growth pattern of k [2] (t) asymptotically converges to a constant, and the U D model shows that the growth pattern of k [2] (t) is asymptotically proportional to ln Information S1 for mathematical analysis).Thegrowthof k [2] (t) in the U I , U M+ , and U M-models is lower than that in the U D model from the early stage until around t = 5, 000 .However, since k [2] (t) in these models after- wards grows faster, k [2] (t) becomes larger than in the U D model.The U M-, U I , and U M+ models have the same indirect utility coefficient as 1, but the direct utility coefficients differ as − 1, 0, and 1 (see Eqs. (S29), (S33), and (S37)).Interestingly, the value of k [2] (t) increases in the order of the U M-, U I , and U M+ models as time passes.These differences results from the balance between direct utility and indirect utility in preference.And, it is counterintuitive that higher k [2] is observed in the U M-model which penalizes the number of direct links k [1] i , since attaching to a node with a high k [1] i is advantageous for the immediate growth of k [2] , which means k [2] = 2k [1] i where i denotes the node attached by a new node.Thus, the structure formed by the preference for k [2] i contributes more to the growth of k [2] than the preference for k [1] i as the network grows up.

The effect of the indirect utility on the growth of the respective nodes in evolving networks
In any case, to understand the rapid growth of k [2] (t) in the U I , U M+ , and U M-models at a late stage, it is necessary to consider the growth mechanism of the network formed by the indirect utility that the three models have in common.Figure 3a shows a possible growth mechanism of the network formed by the U I model, in which the preferential attachment probability is proportional to the indirect utility.As can be seen on the left, when new node j attaches to node i in the existing network, k [1] i increases by one in what is hereafter referred to as "direct growth" from a perspective of node i .At the same time, each k [2] i n of existing nodes i n adjacent to node i increases by one.This increases the preference of nodes i n , since [2] i n is proportional to k [2] i n in the U I model.The figure on the right explains the case where a new node j attaches to i 3 which is one of the nodes i n .This attachment cor- responds to "indirect growth" from the point of view of node i and increases k [1] i 3 and k [2] i by one, and increases the preference of node i .In this way, a specific node i and adjacent nodes i n grow mutually by alternating direct growth and indirect growth.Such growth is hereafter referred to as a "mutual growth".
Figure 3b shows the patterns of direct growth and indirect growth of each node in an evolving network numerically simulated by the U R , U D , and U I models, respectively.Direct growth and indirect growth are set to �k [1] i (t i int and �k [2] i (t) ≡ k [2] i (t) − k [2] i int to eliminate the initial value dependency that occurs when node i first enters the network, and the colour of the heat map indicates the number of nodes in that region.As Fig. 3b indicates, the U I model shows the most positive correlation between k [1] i (t) and k [2] i (t) in these models.In the U R model direct and indirect growth of nodes are relatively insignificant, and In the U D model, high indirect growth is observed in nodes with high direct growth, but no clear correlation across nodes as a whole can be seen.Our mathematical analysis in Supplementary Information S2 demonstrates that the scale of k [2] i (t) of nodes grown considerably in the U R , U D , and U I models is approximately , and (k [1] i (t)) 1.5 , respectively.In particular, in the U I model, two patterns of growth mechanism appear according to the direction of mutual growth (see Supplementary Information S2 for details).A node in which mutual growth of k [1] i (t) and k [2] i (t) dominates the growth mechanism leads the growth of outer areas ( k [l] i (t) ( l > 2 )) and becomes the centre of growth.This growth is hereafter referred to as "active growth".However, if the influence by the outer area k [3] i (t) is dominant in the growth mechanism, mutual growth occurs in the opposite direction to the above, where k [3] i (t) leads to the growth of k [2] i (t) and then k [2] i (t) leads to the growth of k [1] i (t) .This growth is hereafter referred to as "passive growth".Therefore, excluding external influences k [l] i (t) ( l > 2 ), that is, considering only the growth of k [1] i (t) and k [2] i (t) of node i , the indirect preferential attachment leads a scaling exponent of 1.5 (i.e.www.nature.com/scientificreports/ ).Thus, mutual growth by the indirect preferential attachment is ideally expected to have a scaling exponent of 1.5, but a scaling exponent greater than 1.5 appear in the case of passive growth, which is attracted to external growth (the U I model in Fig. 3b).These distinct growth patterns, active growth and passive growth explain the nodes that are the centre of growth and the nodes that propagate the growth in the network structure, respectively, and explain how the local mechanism of mutual growth between direct growth and indirect growth (Fig. 3a) leads to the emergence of a converged and hierarchical structure that the growth of nodes propagate outward from the centre (U I model in Fig. 2a).

The effect of indirect utility on the structure and growth of nodes in real social networks
The mathematical analysis and numerical simulations just described indicate that the mutual growth between direct growth and indirect growth with a scaling exponent of 1.5 in the networks formed by the preferential attachment based on indirect utility.However, observing the mutual growth in real networks and interpreting it as growth by indirect utilities can be challenging, since even from a perspective of the utility-based preferential attachment model, the benefit and cost coefficients can vary depending on the environment in which the network is formed, and also there can be nodes with various preferences in evolving networks.In this regard, our i and k [2] i n increase, which is reflected in the indirect utility of node i n , increasing the preference for node i n .Similarly, When a new node j is attached to node i n , k [1] i n and k [2] i increase, which is reflected in the indirect utility of node i , increasing the preference for node i .Therefore, a node i and neighbouring nodes i n mutually grow through increase each other's preference, and the relationship is established as k [2] i (t) ∝ (k [1] i (t)) 1.5 (see Supplementary Information S2).(b) A set of heat maps between �k [1] i (t) and �k [2] i (t) , each of which shows the distribution of the direct and indirect growth of nodes in an evolving network simulated numerically by the U R , U D , and U I models.The horizontal and vertical arrows denote the orientation of the direct and indirect growth.The number of nodes in each area in the heat map is the average value of 20 numerical simulations ( t = 100, 000 ).The dashed lines are estimates based on mathematical analysis (see Supplementary Information S2).Areas with no growth in direct and indirect growth are excluded for convenience on log scale.mathematical analysis indicates that in the utility-based preferential attachment model with a preference for indirect utility (e.g., U M+ and U M-models), a node with active growth forms the scaling exponent of 1.5, regardless of the benefit and cost coefficients (see Supplementary Information S3).Further, we confirm the dominant effect of indirect utility in a network grown by sequentially attaching nodes with preferential attachments of U R , U D , and U I models, and also confirm the scaling exponent of 1.5 in sufficiently grown nodes (see Supplementary Information S4).These robust results support the assumption that there is a preference based on indirect utility in the evolving mechanism of the network when the scale of mutual growth in sufficiently grown nodes is close to 1.5, from the point of view of the utility-based preferential attachment model.
To confirm the existence of a preference based on indirect utility in social networks, we investigated the scaling exponent of the indirect growth �k [2] i (t) of each node with respect to the direct growth �k [1] i (t) in two real temporal social networks, YouTube and Facebook, which was well observed over time 36,37 .The data on the YouTube network records the friendships that grew for 165 days, and the data on the Facebook network records the comment activities on wall pages of New Orleans users for 850 days.
Figure 4a,b show the results of tracking �k [1] i (t) and �k [2] i (t) of each node in the YouTube and Facebook networks, respectively.As confirmed in the numerical simulation and mathematical analysis, the 1.5 scale due to mutual growth is evident in the sufficiently grown nodes of active growth.Therefore, we coloured the top 50 nodes (blue) in order of high k [1] i , and local hubs (red) with high k [1] i compared to neighbouring nodes that are likely to be in an active growth state.The dashed lines represent the growth of the nodes with scaling exponents of 1.5 in the relationship between �k [1] i (t) and �k [2] i (t) .In the YouTube network, the scaling exponent of the local hubs has a value lower than 1.5 while, in the Facebook network, the value is close to 1.5.The top 50 nodes The results of tracking the direct growth, �k [1] i (t) , and indirect growth, �k [2] i (t) , of each node in the YouTube and Facebook networks.Local hubs (red) are nodes with k [1] i larger than the neighbouring nodes in the final state, and Top50 (blue) corresponds to 50 nodes in the order of higher k [1] i in the final state.The dashed line represents the relationship k [2] i (t) ∝ (k [1] i (t)) 1.5 which is the theoretical estimate of the mutual growth mechanism (see Supplementary Information S2).(c,d) Heat maps in the relationship between �k [1] i (t) and �k [2] i (t) which shows the distribution of the direct and indirect growth of nodes in the YouTube and Facebook networks in the final state (same with Fig. 3b).www.nature.com/scientificreports/ in each social network show a trend similar to that of the scaling exponent for the local hubs.This result indicates the possibility that some indirect utility relates deeply to the major mechanism of the growth of the nodes in the Facebook network whereas other mechanisms may explain the growth of nodes in the YouTube network.Figure 4c,d present heat maps in the relationship between k [1] i and k [2] i of each node entering the YouTube and Facebook networks, respectively.Figure 4d clearly shows that the indirect links in the Facebook network grow with a scaling exponent of 1.5, a result similar to that of the U I model (Fig. 3b).However, no sign of such growth is observable in the YouTube network shown in Fig. 4c.This result suggests that there are mutual growths between the direct and indirect growths in the Facebook network.
Consideration of the meaning of the links in each network is necessary to interpret these results.The friendship links observed on the YouTube network require the mutual consent of each user.However, since this relationship is not publicly revealed to others, its indirect utility may be difficult to observe.In the Facebook network, by contrast, the comments on a user's wall are publicly exposed to others.Therefore, posting a comment on the wall naturally increases a user's exposure to others and the probability that they will visit and comment on the wall.The formation of links triggered by this indirect exposure can be seen as preferential attachment based on indirect utility and suggests the possibility that the direct and indirect growth of nodes affect each other mutually.
Next, we consider the scaling exponent of k [2] i of each node with respect to k [1] i in real static social networks using various categories of datasets.Unlike temporal networks, static networks cannot directly observe the scaling exponent of k [2] i of each node with respect to k [1] i over time, but we roughly approximate the growth www.nature.com/scientificreports/scale using the fact that, as Figs.3b and 4 show, the influence of the scale exponent becomes dominant in the local hubs.Specifically, �k [2] i ∝ (�k for the local hubs satisfying k [1] i > 50 .From the Stanford large network dataset collection, we select 34 undirected and unweighted networks, the categories of which are classified as autonomous systems (3 networks), collaboration networks (5 networks), social networks (23 networks), and Wikipedia (3 networks) 38 .The selection criteria targeted categories grouped into the same type, and only cases where the category contained three or more undirected and unweighted network samples were selected (see "Methods" for details).
Figure 5a shows the averaged value of α calculated from the local hubs of each of the various networks.The social networks in social media (deezer, facebook, physical location-based online social media, Last.fm), and collaboration networks between researchers show a large value of α while the networks of Wikipedia, autonomous systems, and some social networks (github, twitch) show a small value of α .We are unable to completely explain the complex growth mechanism of these real networks, but the preferential attachment based on indirect utility provides a partial explanation for values of α ranging from 1 to 1.5.In the networks in which social interactions take place, as described above, local hubs have a value of α close to 1.5, indicating that the local hubs in the social networks grow not alone but together with the surrounding nodes and suggesting that preferential attachment based on indirect utility may have contributed to this growth.
Further, we investigated the relationship between the scaling exponent α and two representative indicators of the network structure, the "assortativity coefficient" and "clustering coefficient".The assortativity coefficient r indicates a correlation between the degrees of a node and the neighbouring nodes across the network 39 .The clustering coefficient C indicates the average of the ratio of closed triplets among the possible triplets for each node in the whole network.Previous studies indicate that there is a correlation between r and C 40,41 .As Fig. 5c shows, there is a strong correlation between r and C , with a Pearson's correlation coefficient of 0.77.However, after exclusion of the three networks with exceptionally large values of r , C decreases to 0.36.By contrast, as Fig. 5b clearly shows, with the exclusion of the three networks, the Pearson's correlation coefficient between r and α increases from 0.44 to 0.82.The three networks taking very large values of r and C suggest that the cluster- ing of the nodes around the hubs is reflected in the large values of r .Meanwhile, as Fig. 5d shows, there is no significant relationship between α and C.
It is well known that the relationship between r and C appears as a positive correlation in social networks 39 .And, it has been suggested that community structure and clustering can have a positive effect on degree assortativity 40-42 .However, the results reported above indicate that the effect of α can explain the positivity of r in a different, even better direction from the effect of C .Accordingly, the mutual growth pattern by the pref- erential attachment based on indirect utility may be an important growth mechanism of the positive degree assortativity in social networks.

Discussion
Preferential attachment has become an important mechanism for understanding the evolution of social networks [3][4][5][6][7][8] .However, compared to the interest in the mechanism by which such a preference can occur in a physical sense [9][10][11][12][13][14] , there has been a lack of interest in a framework that deals with preferences in a cognitive sense, especially regarding indirect utility [15][16][17][18][19][20][21][22] .In this study, we have introduced the utility-based preferential attachment model by focusing on social preference in human cognition, investigating in particular the effect of the preference attachment based on indirect utility on the structure and growth of evolving networks.Our numerical simulations and mathematical analysis demonstrated that, in evolving networks, converged and hierarchical structures where the growths of nodes propagate outward from the centre are stably emerged through the preferential attachment based on indirect utility.In addition, we show that this growth mechanism promotes the mutual growth of direct growth and indirect growth with a scaling exponent of 1.5 from a perspective of each node, and observe the mutual growth patterns in real networks, especially social networks.
The main finding of our numerical simulations and mathematical analyses is the converged and hierarchical structures (see Fig. 2a) formed by the influx of nodes that prefer nodes with high indirect utility.This structure is a robust outcome confirmed not only in a model taking into consideration only indirect utility (the U I model), but also in models taking into consideration both the direct and indirect utilities (e.g., the U M+ and U M- models, see Supplementary Information S3), and a model in which nodes with different preferences are mixed (see Supplementary Information S4).Our work reveals how preferential attachment based on indirect utility forms a microscopic mutual growth mechanism (see Fig. 3a), and how this mechanism differentiates into active growth and passive growth, and leads to the emergence of the converged and hierarchical structure.Therefore, in a society where there is a continuous influx of people who prefer indirect utility, this structure can emerge deterministically.
In the utility-based preferential attachment model, utility serves not only as to preferences for the attachment to nodes in the network, but also as the utility that each node can obtain from the network.In an evolving network, the total indirect utility increases in proportion to the number of direct links of the node attached by a new node.Thus, the preference proportional to the number of direct links explicitly means the preference for increasing indirect utility.However, despite the absence of an explicit relationship to increasing indirect utility, the preferential attachment based on indirect utility rapidly increases the indirect utility of the whole network, exceeding the total indirect utility of the network formed by the preferential attachment based on direct utility over time (see Fig. 2c).The rapid growth is an outcome not due to a preference of nodes for indirect utility per se, but due to the mechanism of mutual growth between adjacent nodes in the network based on the preference and the converged and hierarchical structure that emerged from it.This increasing pattern of utility is likely the underlying mechanism by which people increase utility in large-scale evolving networks such as cities 43,44 , especially considering that the number of direct connections of people can be finite under cognitive constraints 45,46 .
In the conventional preferential attachment models, the growth of a node depended on the elapsed time after entering the network 3 or on a fitness of individual nodes 7 .Therefore, it is common not to associate preference with the growth of a node, and preference has been understood as an intrinsic property of each node or as physical mechanisms [9][10][11][12][13][14] .However, in the preferential attachment based on indirect utility, attachment by preference and outcome by attachment are dependent mutually.For example, a node attached to a location near centre area of a converged and hierarchical structure grows faster than nodes that do not.And, if the outcome of attachment by a preference leads to greater growth, a feedback loop that reinforces the preference can form.The analysis in a network model grown by sequentially attaching nodes with preferential attachments of the U R , U D , and U I models also supports that nodes with a preference for indirect utility are more advantageous for the growth of direct link and indirect link than nodes with no preference or nodes with preference for direct utility (see Supplementary Information S4).Therefore, preferences for indirect utility are not just an intrinsic property, but a property that can be stably formed by preferential attachment and feedback by its outcomes.It is meaningful to consider this dependent growth and the possibility of forming indirect preferences as its background in social networks.There is dependent growth in the formation of networks between people.For example, popularity among peers in adolescence has a contagious effect 47,48 , and research on weak tie theory and structural holes shows that links that expand indirect connections could be a source of growth [18][19][20] .Under these environmental conditions, people do not simply choose nodes that are random or have a high probability of being connected.People adaptively and strategically evaluate the utility of the nodes with which to form connections, and also pursue their own growth through the choice.Our model shows that this dependent growth mechanism can emerge simply from a preferential attachment based on indirect utility.
The strong correlation between the scaling exponent of indirect growth and degree assortativity (Fig. 5b) opens up new possibilities for positive degree assortativity appeared in evolving social networks 40 .The conventional interpretations of the origin of degree assortativity have been focused on the clustering effect [40][41][42]49 , since the more closed the open-triad is, the more clusters with similar degree are formed. Hoever, the degree of a node adjacent to node i includes not only the links of clusters formed with other nodes adjacent to node i , but also the links formed on outside of the nodes adjacent to node i .The scaling exponent of indirect growth of a node can be described as an indicator independent from the clustering coefficient in that it excludes the link between the adjacent nodes of node i .The fact that this indicator correlates strongly with the degree assortativity indi- cates that the mutual growth mechanism arising from the preferential attachment based on indirect utility may contributes to the positivity of the degree assortativity, which is an important characteristic of social networks.
One of the major limitations of the utility-based preferential attachment model is that preferential attachment requires computing the utility information of all nodes in the network.The assumption that a new node can explore preferences for all nodes is unrealistic, and conventional preferential attachment models have supported feasibility through models based on local mechanisms [9][10][11][12][13][14] .For the preferential attachment based on indirect utility, a local mechanism can be devised in which a link is formed after moving randomly to a node with a shortest distance of 2 from a point selected randomly so that effects of indirect utility can be obtained.In this case, the mutual growth mechanism around a node, which is a local growth mechanism, would work identically with the utility-based preferential attachment model, but the size or formation dynamics of the macroscopic structure may be different.For example, our numerical simulations form converged and hierarchical structures around very few dominant active growth nodes, but if a local growth mechanism is applied, the structures centred on more nodes with active growth can be formed throughout the network.Also, preferences for indirect utility can be considered even in connections between existing nodes for realistic extensions.A recent study shows that selection to increase betweenness centrality forms an ultra-small world network through a game theoretical framework 50 .Preferences for indirect utility are likely to make a similar contribution, as it could be a practical way to increase betweenness centrality.With these extensions, future studies can construct more realistic models.In addition, the definition of the local hub (i.e., a node with higher degree than neighbouring nodes) introduced to estimate the active growth state needs to be refined.Future studies can attempt to increase the observation accuracy of mutual growth through some strategies such as removing nodes with a larger degree among neighbouring nodes, or nodes showing unusual growth patterns.Moreover, the growth mechanism may not always be stable throughout the entire process of network formation.For example, though it is rough, in the case of the YouTube friendship network (see Fig. 4c), the growth pattern of the nodes with prominent growth (Top50) appears to change into a growth pattern with different scaling exponent around k [1] ≈ 100 .In this way, the growth mechanism of real networks may vary depending on the growth scale of nodes, and future studies can consider these heterogeneous growth patterns.Lastly, in this study, we defined the most basic form of utility function to be linearly proportional to the number of directly connected nodes and indirectly connected nodes.However, the model may need to be adjusted depending on the context of the network under investigation.For example, if it is a type of network where the number of human relationships is limited by the cognitive limits of the nodes, such as Dunbar's number 45,46 , a non-linear function in which utility is saturated as the number of nodes increases can be considered.Depending on the nature of the utility or the perception of the utility, there may be cases where the benefit or cost is fixed and not proportional to the number of links 51 .Future research could attempt to investigate the variation and robustness of network structural evolution and mutual growth scaling by adjusting the model in different contexts.
In summary, our study showed growth mechanisms and structural features that emerge from the influx of nodes that prefer indirect utility.These results contribute to the understanding of evolving mechanisms of social networks, and the methodologies can be widely applied to the investigation of microscopic and macroscopic growth patterns of evolving networks.

Utility-based preferential attachment model
We propose as a mathematical model of evolving networks the utility-based preferential attachment model where Here, � [n] i (t) is the preferential attachment probability normalized as is the utility function of node i , which was attached to a node on the existing network at time t i , at time t(≥ t i ) .u [n] i (t) represents the utility that node i obtains from all of the nodes within the n th degree of separation on the network.The first term on the right side of this function, denotes the benefit of node i obtained from all of the nodes within the nth degree of separation from, and the second term, denotes the cost that node i needs to pay to directly connect to the nodes with the 1st degree of separation.Here, c i is a parameter with a positive value representing the cost that node i needs to pay to connect to a node with the 1 st degree of separation from node i .k [l] i (t) is the number of links within the lth degree of separation from node i , which was attached to a node on the existing network at time t i , at time t(≥ t i ) .b i(l) denotes a function with a positive value representing the benefit of node i obtained from a node with the lth degree of separation.Also, m is the number of links attaching a new node to existing nodes.
u [n] i (t) is also rewritten as where Here, u [n][D] i (t) is the direct utility term, that is, the utility of node i obtained from all of the nodes with the 1st degree of separation, and u [n][I] i (t) is the indirect utility term, that is, the utility of node i obtained from all of the nodes within the range between the 2nd and nth degrees of separation.

and U M-models
In what follows, we set c i for arbitrary node i to the same value c and b i(l) for arbitrary node i to the same func- tion b (l) for simplicity.Notably, b (l) is written as simply b l .We restrict the distance between nodes here to n = 2 .Thus, the utility-based preferential attachment model is reduced to where (1) � [2] i (t) = u [2] i (t) www.nature.com/scientificreports/Then, � [2] i (t) is described as and Eqs. ( 11) and (12) show that the utility-based preferential attachment model with n = 2 is described as (see Supplementary Information S3).This model is hereafter referred to as the "utility-based preferential attachment model" for simplicity.In this study, we analyse the case of m = 1.
The U D model represents the situation satisfying u [2] i (t) = u [2][D] i (t) , which is realized in the condition b 2 = 0 , and the substitution of this condition into Eqs.( 13) and ( 14) shows that � [2] i (t) becomes This is the same preferential attachment probability as the BA model, and this means that considering only direct utility in the preference, the utility-based preferential attachment model is reduced to the BA model.Then, the substitution of b 2 = 0 into Eqs. ( 15 and (16) shows that the U D model is mathematically described as The U I model represents the situation satisfying u [2] i (t) = u [2][I] i (t) , which is realized in the condition b 1 = c , and substituting this condition into Eqs.( 13) and ( 14) indicates that � [2] i (t) becomes Then, the substitution of b 1 = c into Eqs. ( 15 and ( 16) shows that the U I model is mathematically described as The U M+ model represents the situation satisfying u [2] i , which is realized in the condition b 1 = c + 1, b 2 = 1 , and the substitution of this condition into Eqs. ( 13 and (14) shows that � [2] i (t) becomes (12) u [2] i (t) = u [2][D] i (t) + u [2][I] i (t) = g(c, b 1 )k [1] i (t) + b 2 k [2] i (t), (14) � [2] i (t) = g(c, b 1 )k [1] i (t) + b 2 k [2] i (t) � [2] i (t) = d dt k [2] i (t) = .

The criteria for selecting a static social network in Stanford large network dataset collection
The Stanford large network dataset collection 38 divides network datasets into 23 categories.Our selection criteria are as follows: (1) a category in which the properties of nodes can be grouped into the same type, (2) containing at least three network samples to discuss general properties of the category, (3) an unweighted and undirected network like our mathematical model, and (4) a network in which the local hub's degree has grown sufficiently as in the analysis criteria ( k [1] i > 50 ).The categories that met our criteria are "autonomous systems", "social networks", "location-based online social networks", "collaboration networks", and "Wikipedia", of which "social networks" and "location-based online social networks" are classified into "social networks" types together.Two networks are excluded as exceptions."ego-Facebook" network in the "social networks" category is not of interest because it is a dataset about the ego network, and "as-Skitter" network in "autonomous systems" is excluded because it is too large to analyse.As a result, 34 networks met our criteria: autonomous systems (3 networks), collaboration networks (5 networks), social networks (23 networks), and Wikipedia (3 networks).( 24) d dt k [2] i (t) = k [1] i (t) 2 + k [2] i (t) + k [3] i (t) t j=1 k [1] i (t) + k [2] j (t) .

Figure 2 .
Figure 2. Effect of indirect utility on the structure of the networks formed by the utility-based preferential attachment model.(a) An illustration of a network formed by the respective U R , U D , and U I models at t = 5000 .The red colour represents the local hub, which is a node with k[1]   i greater than the surrounding nodes, and it is displayed in blue gradually according to the distance from the nearest local hub.In U I model, a converged and hierarchical structure forms in which many nodes grow sequentially around a very large local hub.(b,c) The distribution P(k[1]  ) and the growth of the ratio k[2] (t)(≡ t j=1 k[2] j (t)/ t j=1 k [1] j (t)) in the utility-based preferential attachment models.The solid line represents the average of 20 numerical simulations ( t = 100, 000 ), and the error bar denotes the standard deviation.

Figure 3 .
Figure 3.A growth mechanism of the U I model and the relationship between direct and indirect growth in a network formed by the U R , U D , and U I models.(a) A growth mechanism for the networks formed by the U I model.If new node j attached to i .When a new node j is attached to node i , k [1]i and k[2] i n increase, which is reflected in the indirect utility of node i n , increasing the preference for node i n .Similarly, When a new node j is attached to node i n , k[1]   i n and k[2] i increase, which is reflected in the indirect utility of node i , increasing the preference for node i .Therefore, a node i and neighbouring nodes i n mutually grow through increase each

Figure 4 .
Figure 4. Relationship between the direct and indirect growth of nodes in real temporal social networks.(a,b)The results of tracking the direct growth, �k[1]   i (t) , and indirect growth, �k[2] i (t) , of each node in the YouTube and Facebook networks.Local hubs (red) are nodes with k[1]   i larger than the neighbouring nodes in the final state, and Top50 (blue) corresponds to 50 nodes in the order of higher k[1]   i in the final state.The dashed line represents the relationship k[2] i (t) ∝ (k[1]   i (t)) https://doi.org/10.1038/s41598-023-48827-6

Figure 5 .
Figure 5. Averaged scaling exponent of the indirect growth of local hubs with respect to the direct growth and the relationship among the scaling exponent, cluster coefficient, and degree assortativity coefficient in real static networks.(a) The average scaling exponent α calculated from the local hubs in each of 34 undirected and unweighted networks for which the categories are classified as autonomous systems (3 networks), collaboration networks (5 networks), social networks (23 networks), and Wikipedia (3 networks).(b-d) The correlation between scaling exponent α and the degree assortativity coefficient r , between the cluster coefficient C and r , and between α and C .The grey line represents the correlation of the entire network, and the blue line represents the correlation except for networks with exceptionally large values of r and C .The value in the legend is a Pearson's correlation coefficient.The colour of the dot is the same as the categories identified in (a). https://doi.org/10.1038/s41598-023-48827-6www.nature.com/scientificreports/